SVM based speaker recognition: harnessing trials with multiple enrollment sessions

نویسندگان

  • Jason W. Pelecanos
  • Weizhong Zhu
  • Sibel Yaman
چکیده

In this paper we extend a variation of the trial-based SVM speaker verification work proposed by Cumani et al to exploit multiple enrollment sessions. Specifically, Cumani proposed the use of a 2nd order SVM kernel for the binary classification of basic trials. In this new work, trials with multiple enrollment sessions are modelled by stacking the i-vectors of the test and enrollment sessions. We further exploit the fact that the score should be independent of the enrollment recording order and present a simplified 2nd order polynomial kernel scoring function accordingly. In the second part of this work we examine the utility of enrollment pruning for multi-session enrollments. Past work demonstrates that pruning can be beneficial for PLDA based systems. We examine the effects of enrollment pruning in the context of the proposed SVM model. The results demonstrate that the multi-session enrollment SVM kernel is generally better than the model trained using single sessions. The model is also comparable in performance to the PLDA based approach. Further gains are observed through combination of the PLDA and SVM scores.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting Goats in Speaker Verification Systems

We present a method for detecting goats in a textdependent speaker verification system using only the enrollment data. The goat detection process is based on extracting an appropriate feature from each enrollment session and ranking all the enrollment sessions in the system according to this feature. The lowest-ranking sessions, which are likely to have a high false-reject rate, are selected. W...

متن کامل

Towards Goat Detection in Text-Dependent Speaker Verification

We present a method that identifies speakers that are likely to have a high false-reject rate in a text-dependent speaker verification system (“goats”). The method normally uses only the enrollment data to perform this task. We begin with extracting an appropriate feature from each enrollment session. We then rank all the enrollment sessions in the system based on this feature. The lowest-ranki...

متن کامل

The NIST SRE summed channel speaker recognition system

This paper presents an improved speaker recognition system for the summed channel evaluation tasks in the 2008 NIST SRE (SRE08) with multiple summed-channel excerpts for speaker training and one summed-channel excerpt for testing. The system includes three main modules in which a hybrid speaker purification and clustering algorithm is adopted to segregate the summed-channel speech, a common spe...

متن کامل

Sparse kernel machines with empirical kernel maps for PLDA speaker verification

Previous studies have demonstrated the benefits of PLDA-SVM scoring with empirical kernel maps for i-vector/PLDA speaker verification. The method not only performs significantly better than the conventional PLDA scoring and utilizes the multiple enrollment utterances of target speakers effectively, but also opens up opportunity for adopting sparse kernel machines in PLDA-based speaker verificat...

متن کامل

A model-based transformational approach to robust speaker recognition

A novel statistical modeling and compensation method for robust speaker recognition is presented. The method specifically addresses the degradation in speaker verification performance due to the mismatch in channels (e.g., telephone handsets) between enrollment and testing sessions. In mismatched conditions , the new approach uses speaker-independent channel transformations to synthesize a spea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014